WHAT IS CLAIMED IS: 



1 1. A method, comprising: 

2 receiving an identifier of a first participant of a plurality of participants of a 

3 multi-party conference; 

4 receiving a first signal indicating that the first participant is providing 

5 information to the multi-party conference; and 

6 displaying a first indication that the first participant is providing 

7 information to the multi-party conference in response to the received first signal. 

1 2. The method of claim 1, further comprising terminating the first 

2 signal when the first participant is no longer providing information to the multi-party 

3 conference. 

1 3. The method of claim 2, further comprising terminating display of the 

2 first indication in response to termination of the first signal. 

1 4. The method of claim 1, further comprising: 

2 while displaying the first indication, displaying an identifier of a second 

3 participant of the plurality of participants without indicating that the second participant is 

4 providing information to the multi-party conference. 

1 5. The method of claim 1, wherein the first indication comprises 

2 displaying the identifier so as to flash. 
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1 6. The method of claim 1, wherein the first indication comprises 

2 displaying the identifier using a preselected color scheme. 



1 7. The method of claim 1, wherein the first indication comprises 

2 displaying the identifier as an animated graphic. 

1 8. The method of claim 1, wherein the first indication comprises 

2 displaying the identifier using video. 

1 9. The method of claim 1, wherein the first indication comprises 

2 displaying the identifier as a photograph. 

1 10. The method of claim 1, wherein the first indication comprises 

2 displaying the identifier with a graphic proximate to the identifier of the first participant. 

1 11. The method of claim 10 5 wherein the graphic proximate to the 

2 identifier of the fir st participant comprises text. 

1 12. The method of claim 1, wherein the first indication comprises 

2 displaying the identifier at a position within a list of identifiers of the plurality of 

3 participants designated for a participant providing information to the multi-party 

4 conference. 



36 



1 13. The method of claim 1, wherein the first indication comprises 

2 displaying the identifier of the first participant in an area of a display designated for a 

3 participant providing information to the multi-party conference. 

1 14. The method of claim 1, wherein the identifier of the first participant 

2 comprises a location associated with the first participant. 

1 15. The method of claim 1, wherein the first signal is received when the 

2 first participant is speaking over an audio link. 

1 16. The method of claim 15, wherein the audio link comprises a 

2 telephone link. 

1 17. The method of claim 15, wherein the audio link comprises a satellite 

2 link. 

1 18. The method of claim 15, wherein the audio link comprises a voice 

2 over Internet Protocol (VoIP) link. 

1 19. The method of claim 1, further comprising receiving information 

2 from a second participant of the plurality of participants via a network while the fust 

3 participant is providing information to the multi-party conference. 
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1 20. The method of claim 19, wherein receiving information from the 

2 second participant further comprises receiving information via client-server 

3 communication over the network. 

1 21. The method of claim 19, wherein receiving information from the 

2 second participant further comprises receiving information via peer-to-peer 

3 communication over the network. 

1 22. The method of claim 19, wherein the network comprises the 

2 Internet. 

1 23. The method of claim 19, wherein the network comprises a local area 

2 network. 

1 24. The method of claim 19, wherein the network comprises a wireless 

2 network. 

1 25. The method of claim 1, further comprising: 

2 concurrently with the first indication, displaying a second indication that a 

3 second participant of the plurality of participants is providing information to the multi- 

4 party conference in response to receiving a second signal concurrently with the first 

5 signal, the second signal indicating that the second participant is providing information to 

6 the multi-party conference. 
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1 26. The method of claim 1, further comprising suppressing an indication 

2 that a second participant of the plurality of participants in response to receiving a signal 

3 concurrently indicating that the other providing information to the multi-party 

4 conference. 

1 27. A computer-accessible medium having computer-executable 

2 instructions to perform operations comprising: 

3 receiving an identifier of a first participant of a plurality of participants of a 

4 multi-party conference; 

5 receiving a first signal indicating that the first participant is providing 

6 information to the multi-party conference; and 

7 displaying a first indication that the first participant is providing 

8 information to the multi-party conference in response to the received first signal. 

1 28. The computer-accessible medium of claim 27, wherein the 

2 operations further comprise: 

3 while displaying the first indication, displaying an identifier of a second 

4 participant of the plurality of participants without indicating that the second participant is 

5 providing information to the multi-party conference. 

1 29. The computer-accessible medium of claim 27, wherein the first 

2 indication comprises displaying the identifier so as to flash. 
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1 30. The computer-accessible medium of claim 27, wherein the first 

2 indication comprises displaying the identifier using a preselected color scheme. 

1 31. The computer-accessible medium of claim 27, wherein the first 

2 indication comprises displaying the identifier as an animated graphic. 

1 32. The computer-accessible medium of claim 27, wherein the first 

2 indication comprises displaying the identifier using video. 

1 33. The computer-accessible medium of claim 27, wherein the first 

2 indication comprises displaying the identifier as a photograph. 

1 34. The computer-accessible medium of claim 27, wherein the first 

2 indication comprises displaying the identifier with a graphic proximate to the identifier of 

3 the first participant. 

1 35. The computer-accessible medium of claim 34, wherein the graphic 

2 proximate to the identifier of the first participant comprises text. 

1 36. The computer-accessible medium of claim 27, wherein the first 

2 indication comprises displaying the identifier at a position within a list of identifiers of 

3 the plurality of participants designated for a participant providing information to the 

4 multi-party conference. 
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1 37. The computer-accessible medium of claim 27, wherein the fust 

2 indication comprises displaying the identifier of the first participant in an area of a 

3 display designated for a participant providing information to the multi-party conference. 

1 38. The computer-accessible medium of claim 27, wherein the first 

2 signal is received when the first participant is speaking over an audio link. 

1 39. The computer-accessible medium of claim 27, wherein the 

2 operations further comprise: 

3 concurrently with the first indication, displaying a second indication that a 



4 second participant of the plurality of participants is providing information to the multi- 

5 party conference in response to receiving a second signal concurrently with the first 

6 signal, the second signal indicating that the second participant is providing information to 

7 the multi-party conference. 



1 40. The computer-accessible medium of claim 27 wherein the operations 

2 further comprise: 

3 suppressing an indication that a second participant of the plurality of 

4 participants in response to receiving a signal concurrently indicating that the other 

5 providing information to the multi-party conference. 

1 41. A system for multi-party conferencing for a plurality of participants, 

2 the system comprising: 

3 a user interface (UI); and 
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4 an information source indicator to cause the UI to display an identifier 

5 corresponding to a participant in response to receiving a signal that indicates that the 

6 participant is providing information to the multi-party conference, wherein the displayed 

7 identifier has a property that indicates that the participant is providing information to the 

8 multi-party conference. 

1 42. The system of claim 41, wherein the signal terminates in response to 

2 the participant no longer providing information to the multi-party conference. 

1 43. The system of claim 42, wherein the information source indicator is 

2 to cause the UI to display the identifier without the property in response to the signal 

3 terminating. 

1 44. The system of claim 41, wherein the information source indicator is 



2 further to cause the UI to display a plurality of identifiers corresponding to the plurality 

3 of participants without the property that indicates that the plurality of corresponding 

4 participants are providing information to the multi-party conference. 



1 45. The system of claim 41, wherein the information provided by the 

2 participant is the participant's speech over an audio link. 

1 46. The system of claim 41, wherein the UI is to selectively display 

2 additional information received from the network while the participant is providing 

3 information to the multi-party conference. 
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1 47. A method, comprising: 

2 detecting an acoustic signal; 

3 determining whether the detected acoustic signal was generated by a person 

4 speaking; and 

5 providing a signal indicating that the detected acoustic signal was generated 

6 by the person. 

1 48. The method of claim 47, wherein determining whether the detected 

2 acoustic signal was generated by a person further comprises: 

3 determining whether the detected acoustic signal was speech from an audio 

4 transducer device. 

1 49. The method of claim 47, wherein the signal is transmitted over a 

2 network to participants of a multi-party conference. 

1 50. The method of claim 47 wherein determining whether the detected 

2 acoustic signal was generated by a person further comprises: 

3 receiving a frame of audio data derived from the detected acoustic signal; 

4 classifying the received frame based on spectral data of the received frame; 

5 and 

6 determining a source of a portion of the detected acoustic signal used to 

7 derive the frame based on the classification of the frame and a prior determination of a 

8 source of a detected acoustic signal portion. 
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1 51. The method of claim 50, wherein the spectral data is obtained by 

2 performing a frequency transform on the frame of audio data. 

1 52. The method of claim 5 1, wherein the spectral data is obtained by 

2 performing a fast Fourier transform (FFT) on the frame of audio data. 

1 53. The method of claim 5 1, wherein the spectral data is obtained by 

2 performing a modulated complex lapped transform (MCLT) on the frame of audio data. 

1 54. The method of claim 50, further comprising detennining a first 

2 frequency band's energy and a second frequency band's energy from the spectral data. 

1 55. The method of claim 54, wherein the first frequency band 

2 corresponds to a frequency range for consonants and the second frequency band 

3 corresponds to a frequency range for vowels. 

1 56. The method of claim 55, further comprising classifying the frame as 

2 being generated by a person when the ratio of the energies of the first and second 

3 frequency bands exceeds a first predetermined threshold. 

1 57. The method of claim 56, further comprising selectively classifying 

2 the frame as being from a different source when the ratio of the energies of the first and 

3 second frequency bands is below a second predetermined threshold. 
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1 58. The method of claim 57, further comprising selectively classifying 

2 the frame as having an unknown source when the ratio of the energies of the first and 

3 second frequency bands exceeds the second predetennined threshold and is below the 

4 first predetermined threshold, the second predetermined threshold being less than the first 

5 predetennined threshold. 



1 59. The method of claim 55, further comprising determining whether the 

2 frame was derived from speech, wherein speech includes acoustic signals generated by a 

3 person spealdng and acoustic signals generated by an audio transducer device. 

1 60. The method of claim 59, further comprising detennining noise floor 



2 energies of the first and second frequency bands using the spectral data, wherein a frame 

3 is selectively classified as being derived from speech in response to the energy of the first 

4 frequency band exceeding the noise floor energy of the first frequency band or the energy 

5 of the second frequency band exceeding the noise floor of the second frequency band, or 

6 both. 

1 61. The method of claim 60, wherein classifying the received frame 

2 further comprises: 

3 detennining whether a frame received within a predetennined number of 

4 frames relative to the received frame has substantially all of its energy in the second 

5 frequency band. 
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1 62. The method of claim 60, wherein classifying the received frame 

2 further comprises: 

3 determining whether a frame adjacent to the received frame was classified 

4 as derived from speech. 

1 63. The method of claim 50, wherein classifying the received frame 



2 further comprises classifying the received frame as one of the plurality of predetermined 

3 frame types comprising a live-type frame; a phone-type frame; and an unsure-type frame, 

4 wherein live-type frames represent frames determined to be derived from acoustic signals 

5 generated by a person speaking, and phone-type frames represent frames determined to 

6 be derived from acoustic signals generated by an audio transducer device. 



1 64. The method of claim 63, further comprising detennining the source 

2 of the detected acoustic signal to be an acoustic signal generated by a person, if: 

3 the prior determination of a source of a detected acoustic signal portion is 

4 that the source was an audio transducer device; 

5 the frame is classified as a live-type frame; and 

6 a predetermined number of prior frames includes live-type frames that 

7 exceed a predetermined live-type frame count threshold. 



1 65. The method of claim 63 , further comprising determining the source 

2 of the detected acoustic signal to be unsure, if: 

3 the prior determination of a source of a detected acoustic signal portion is 

4 that the source was an audio transducer device; 

5 the frame is classified as a live-type frame; 

6 a predetermined number of most recent frames do not include enough live- 

7 type frames to exceed a predetermined threshold; and 

8 an elapsed time since receiving a previous frame derived from speech 

9 exceeds a predetermined first time threshold. 

1 66. The method of claim 63, further comprising determining the source 

2 of the detected acoustic signal to be an audio transducer device, if: 

3 the prior determination of a source of a detected acoustic signal portion is 

4 that the source was an acoustic signal generated by a person speaking; 

5 the frame is classified as a phone-type frame; 

6 an elapsed time since receiving a previous live-type frame exceeds a 

7 predetermined second time threshold; and 

8 a counter value does not exceed a predetermined count threshold, the 

9 counter value to track a number of consecutive non-live-type frames received after 

10 receiving a live-type frame of most recent frames do not include enough live-type frames 

1 1 to exceed a predetermined threshold. 
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1 67. The method of claim 63, further comprising determining the source 

2 of the detected acoustic signal to be unsure, if: 

3 the prior determination of a source of a detected acoustic signal portion is 

4 that the source was an acoustic signal generated by a person speaking; 

5 the frame is classified as a phone-type frame; 

6 an elapsed time since receiving a previous live-type frame exceeds a 

7 predetermined second time threshold; and 

8 the counter value is below a predetermined count threshold, the counter 



9 value to track a number of consecutive non-live-type frames received after receiving a 

10 live-type frame of most recent frames do not include enough live-type frames to exceed a 

1 1 predetermined threshold. 



1 68. The method of claim 63, further comprising determining the source 

2 of the detected acoustic signal to be an acoustic signal generated by a person speaking, if: 

3 the prior determination of a source of a detected acoustic signal portion is 

4 unsure; 

5 the frame is classified as a live-type frame; and 

6 a predetermined number of most recent prior frames includes live-type 

7 frames that exceed in number a predetermined live-type fr ame count threshold. 



1 69. The method of claim 63, further comprising determining the source 

2 of the detected acoustic signal to be an acoustic transducer device, if: 

3 the prior deteimination of a source of a detected acoustic signal portion is 

4 unsure; 

5 the frame is classified as a phone-type frame; and 

6 a predetermined number of most recent prior fr ames includes phone-type 

7 frames that exceed in number a predetermined phone-type frame count threshold. 

1 70. A computer-accessible medium having computer-executable 

2 instructions to perform operations comprising: 

3 detecting an acoustic signal; 

4 determining whether the detected acoustic signal was generated by a person 

5 speaking; and 

6 providing a signal indicating that the detected acoustic signal was generated 

7 by the person. 

1 71. The computer-accessible medium of claim 70, wherein determining 

2 whether the detected acoustic signal was generated by a person further comprises: 

3 determining whether the detected acoustic signal was speech from an audio 

4 transducer device. 

1 72. The computer-accessible medium of claim 70, wherein the signal is 

2 transmitted over a network to participants of a multi-party conference. 
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1 73. The computer-accessible medium of claim 70, wherein deteimiiiing 

2 whether the detected acoustic signal was generated by a person further comprises: 

3 receiving a frame of audio data derived from the detected acoustic signal; 

4 classifying the received frame based on spectral data of the received frame; 

5 and 

6 determining a source of a portion of the detected acoustic signal used to 

7 derive the frame based on the classification of the frame and a prior determination of a 

8 source of a detected acoustic signal portion. 

1 74. The computer-accessible medium of claim 73, wherein the spectral 

2 data is obtained by performing a frequency transform on the frame of audio data. 

1 75. The computer-accessible medium of claim 73, wherein the 

2 operations further comprise determining a first frequency band's energy and a second 

3 frequency band's energy from the spectral data. 

1 76. The computer-accessible medium of claim 75, wherein the first 

2 frequency band corresponds to a frequency range for consonants and the second 

3 frequency band corresponds to a frequency range for vowels. 

1 77. The computer-accessible medium of claim 76, wherein the 

2 operations further comprise: 

3 classifying the frame as being generated by a person when the ratio of the 

4 energies of the first and second frequency bands exceeds a first predetennined threshold. 
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1 78. The computer-accessible medium of claim 77, wherein the 

2 operations further comprise: 

3 selectively classifying the frame as being from another source when the 

4 ratio of the energies of the first and second frequency bands is below a second 

5 predetermined threshold. 

1 79. The computer-accessible medium of claim 78, wherein the 

2 operations further comprise: 

3 selectively classifying the frame as having an unknown source when the 



4 ratio of the energies of the first and second frequency bands exceeds the second 

5 predetermined threshold and is below the first predetermined threshold, the second 

6 predetermined threshold being less than the first predetermined threshold. 



1 80. The computer-accessible medium of claim 76, wherein the 

2 operations further comprise: 

3 determining whether the frame was derived from speech, wherein speech 

4 includes acoustic signals generated by a person speaking and acoustic signals generated 

5 by an audio transducer device. 

1 81. The computer-accessible medium of claim 80, wherein the 

2 operations further comprise: 

3 determining noise floor energies of the first and second frequency bands 

4 using the spectral data, wherein a frame is selectively classified as being derived from 
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speech in response to the energy of the first frequency band exceeding the noise floor 
energy of the first frequency band or the energy of the second frequency band exceeding 
the noise floor of the second frequency band, or both. 

82. The computer-accessible medium of claim 8 1, wherein classifying 
the received frame further comprises: 

detennining whether a frame adjacent to the received frame was classified 
as derived from speech. 

1 83. The computer-accessible medium of claim 81, wherein classifying 

2 the received frame further comprises: 

3 determining whether a frame within a predetermined number of frames 

4 relative to the received frame has substantially all of its energy in the second frequency 

5 band. 

1 84. The computer-accessible medium of claim 74, wherein classifying 

2 the received frame further comprises: 

3 classifying the received frame as one of the plurality of predetermined 

4 frame types comprising a live-type frame; a phone-type frame; and an unsure-type frame, 

5 wherein live-type frames represent frames determined to be derived from acoustic signals 

6 generated by a person speaking, and phone-type frames represent frames determined to 

7 be derived from acoustic signals generated by an audio transducer device. 



1 85. The computer-accessible medium of claim 84, wherein the 

2 operations further comprise determining the source of the detected acoustic signal to be 

3 an acoustic signal generated by a person, if: 

4 the prior determination of a source of a detected acoustic signal portion is 

5 that the source was an audio transducer device; 

6 the frame is classified as a live-type frame; and 

7 a predetermined number of prior frames includes live-type frames that 

8 exceed a predeteraiined live-type frame count threshold. 

1 86. The computer-accessible medium of claim 84 ? wherein the 

2 operations further comprise deterauning the source of the detected acoustic signal to be 

3 unsure, if: 

4 the prior determination of a source of a detected acoustic signal portion is 

5 that the source was an audio transducer device; 

6 the frame is classified as a live-type frame; 

7 a predetermined number of most recent frames do not include enough live- 

8 type frames to exceed a predetermined threshold; and 

9 an elapsed time since receiving a previous frame derived from speech 
10 exceeds a predetermined first time threshold. 
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1 87. The computer-accessible medium of claim 84, wherein the 

2 operations further comprise detenmning the source of the detected acoustic signal to be 

3 an audio transducer device, if: 

4 the prior determination of a source of a detected acoustic signal portion is 

5 that the source was an acoustic signal generated by a person speaking; 

6 the frame is classified as a phone-type frame; 

7 an elapsed time since receiving a previous live-type frame exceeds a 

8 predetermined second time thr eshold; and 

9 a counter value does not exceed a predetermined count threshold, the 



10 counter value to track a number of consecutive non-live-type frames received after 

11 receiving a live-type frame of most recent frames do not include enough live-type frames 

12 to exceed a predetermined threshold. 



1 88. The computer-accessible medium of claim 84, wherein the 

2 operations further comprise detennining the source of the detected acoustic signal to be 

3 unsure, if: 

4 the prior determination of a source of a detected acoustic signal portion is 

5 that the source was an acoustic signal generated by a person speaking; 

6 the frame is classified as a phone-type frame; 

7 an elapsed time since receiving a previous live-type frame exceeds a 

8 predetermined second time threshold; and 

9 the counter value is below a predeteimined count threshold, the counter 
10 value to track a number of consecutive non-live-type frames received after receiving a 
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live-type frame of most recent frames do not include enough live-type frames to exceed a 
predetermined threshold. 



1 89. The computer-accessible medium of claim 84, wherein the 

2 operations further comprise determining the source of the detected acoustic signal to be 

3 an acoustic signal generated by a person speaking, if: 

4 the prior detennination of a source of a detected acoustic signal portion is 

5 unsure; 

6 the frame is classified as a live-type frame; and 

7 a predetermined number of most recent prior frames includes live-type 

8 frames that exceed in number a predetermined live-type frame count threshold. 

1 90. The computer-accessible medium of claim 84, wherein the 

2 operations further comprise determining the source of the detected acoustic signal to be 

3 an acoustic transducer device, if: 

4 the prior determination of a source of a detected acoustic signal portion is 

5 unsure; 

6 the frame is classified as a phone-type frame; and 

7 a predetermined number of most recent prior frames includes phone-type 

8 frames that exceed in number a predetermined phone-type frame count threshold. 

1 91. A system, comprising: 

2 an audio discriminator to determine whether a detected acoustic signal was 

3 generated by a person in speaking; and 
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4 a sourcing signaler to selectively provide a signal indicating that the person 

5 is speaking in response to a determination by the audio discriminator that the detected 

6 acoustic signal was generated by the person in speaking. 

1 92. The system of claim 91, wherein the sourcing signaler is to provide 

2 the signal to participants of a multi-party conference over a network. 

1 93. The system of claim 91, wherein the audio discriminator is a 

2 component of the sourcing signaler. 

1 94. The system of claim 91, wherein the audio discriminator comprises: 

2 a spectrum analyzer to obtain spectrum data from a frame of audio data 

3 acquired from the detected acoustic signal; 

4 a frame classifier to determine which of a set of frame-types the frame 

5 matches, based on the spectrum data obtained by the spectrum analyzer; and 

6 a hidden Markov model (HMM) machine to determine whether the detected 

7 acoustic signal was sourced by an acoustic signal generated by a person speaking using 

8 the classification of the frame-type. 

1 95. The system of claim 91, wherein the audio discriminator comprises; 

2 a spectrum analyzer to obtain spectrum data from a frame of audio data 

3 acquired from the detected acoustic signal; 

4 a frame classifier to determine which of a set of frame-types the frame 

5 matches, based on the spectrum data obtained by the spectrum analyzer; and 
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6 a finite state machine (FSM) 

7 signal was sourced by an acoustic signal 

8 classification of the frame-type . 



to determine whether the detected acoustic 
generated by a person speaking using the 



